CDS

Accession Number TCMCG042C81900
gbkey CDS
Protein Id XP_016514036.1
Location complement(join(69202..69263,69599..69636,69793..69881,70218..70294,70379..70426,70515..70634,70725..70815,72426..72601,72739..72826,72989..73233,74992..75354,75447..75636))
Gene LOC107830878
GeneID 107830878
Organism Nicotiana tabacum

Protein

Length 528aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA319578
db_source XM_016658550.1
Definition PREDICTED: probable Ufm1-specific protease isoform X2 [Nicotiana tabacum]

EGGNOG-MAPPER Annotation

COG_category S
Description Peptidase family C78
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
KEGG_ko ko:K01376        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0003824        [VIEW IN EMBL-EBI]
GO:0006508        [VIEW IN EMBL-EBI]
GO:0006807        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0008152        [VIEW IN EMBL-EBI]
GO:0008233        [VIEW IN EMBL-EBI]
GO:0008234        [VIEW IN EMBL-EBI]
GO:0016787        [VIEW IN EMBL-EBI]
GO:0019538        [VIEW IN EMBL-EBI]
GO:0019783        [VIEW IN EMBL-EBI]
GO:0043170        [VIEW IN EMBL-EBI]
GO:0044238        [VIEW IN EMBL-EBI]
GO:0070011        [VIEW IN EMBL-EBI]
GO:0071567        [VIEW IN EMBL-EBI]
GO:0071704        [VIEW IN EMBL-EBI]
GO:0140096        [VIEW IN EMBL-EBI]
GO:1901564        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGTTAGCGAAACCCAAGACCCGACTATTCGGATCCTCTGCCGGAGGCTCCAGATTATCAAGAATGAATCGGGTCTTCAATGGCTCATCGGCTCTCCATTCTTCCCTCGCCATACCATTATCTCTACCTTCCGATGTATCCACACTACCCCCTCCAATCCTCTATCTCCGGATTTCTCCAAAGAATCAGATGATATAAGAACACTGCTTCCGAAGGGTTTTGAAGTTATTGGAGCTTTAATTGTCGAAAATGACGGAAATTTGGACAAAATCGCCGGGGAGGCGATTAATGCAGCTTGTAATTTGAAGAAAAGTTTGTCGAGTGATGAAAATTTGGGCAATCTGGAGCTGGTCGGTGCTGTGGTGGATTTAAATAGCGCTAATGATGTTCGTTTTTTCGTGTCGAAGGATGGAAAATTGGGCAGTCTTCAGAGTGTTAGTTCCATTATGTACGAAGACAAGCCTGAAAAGTATATTTGGGAAAGAGGCTGTTTGCTTCGGTGTGCTCTTCATGTAAAATTGCCTCTGTATTATAATACTAGTAACCCTGATGATGTACATGAGATATATATGCGTGCAGCTGAAGCTGTTGCTAGTAAATTCAAAGACCCACAAGTTACTTGCCTAATAGAAGCTTTAGCTGAAACTTCAAGTGGTGCTATCGTTCTTCGTGGTTCAGACCTGAACACATATAGTTCAAATTCTTCCTCTGAACTTAAAGATTCTGATACGAAAGCTTTGTTATGTTCATACTTCTTTTCAATGAGTAAAGATATTACCTCATTTTCTTCAATAGAGAATGCAGATAAAATCCAAGTAAGCTTTCTGCTTAATAAATCAATAAATTCTGCAAAACCTTCTGCACCTATTGCTGAATATTATCCAGCCACCCAGGAAACTGAACTTCTGGTCATAGGCCATAAACTTGAAGTGCTTTGTTATGCGGCAAAAGATCTATCGCTGGCTTATAGCGTATCAAAGTTGGTCATCCCTGCACTACTTGACCAGTTACACTCAATGAGGAAAGTTATCATGCCTGACCTCTTAAAGGGGCATCCGGAGTGGCATCCATATCACTTCCTTCCTCCAGGACTTTTACATCCGATTACAGTCTTGTATGAACTTAGTTATGGTGAGACAGAACTGAAGCAAGTTGAAACAAGAAGATCCCTCCATTTGAGACTTGGGTTACCTTTTGATCGCCCTCTTCTTAGAATTTCAAATGCTATTGATCTAGTAGGGAAGAAGAATACTGGCAGCTCAGTCCAGAAAGGCTCTTCTTTGCTTAAGGATGCACATTTGGGGATTCCATGTAGTGGTGTTTCTGGAGGTGTCTCCTCTCTGGTTCAAGGTTCTTATGAGTACTACCATTACCTCCATGAGGGACTTGATGACTCGGGGTGGGGCTGTGCTTACCGCTCTCTGCAGACAATCATTTCTTGGTTCAAGTTGCAAAATTACACTTCGATTGATGTCCCATCACACAGCTCCCTCTTTTTAAAACAAGAGACTGCTAGATATAAAATGAAAACAAACAGAACACAAGGAGTACCATCTATTGCGCAGAGCAACATAAATCATGGTTAA
Protein:  
MVSETQDPTIRILCRRLQIIKNESGLQWLIGSPFFPRHTIISTFRCIHTTPSNPLSPDFSKESDDIRTLLPKGFEVIGALIVENDGNLDKIAGEAINAACNLKKSLSSDENLGNLELVGAVVDLNSANDVRFFVSKDGKLGSLQSVSSIMYEDKPEKYIWERGCLLRCALHVKLPLYYNTSNPDDVHEIYMRAAEAVASKFKDPQVTCLIEALAETSSGAIVLRGSDLNTYSSNSSSELKDSDTKALLCSYFFSMSKDITSFSSIENADKIQVSFLLNKSINSAKPSAPIAEYYPATQETELLVIGHKLEVLCYAAKDLSLAYSVSKLVIPALLDQLHSMRKVIMPDLLKGHPEWHPYHFLPPGLLHPITVLYELSYGETELKQVETRRSLHLRLGLPFDRPLLRISNAIDLVGKKNTGSSVQKGSSLLKDAHLGIPCSGVSGGVSSLVQGSYEYYHYLHEGLDDSGWGCAYRSLQTIISWFKLQNYTSIDVPSHSSLFLKQETARYKMKTNRTQGVPSIAQSNINHG